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Covariant quantum measurements may not be optimal 
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Abstract. Quantum particles, such as spins, can be used for communicating spatial 

directions to observers who share no common coordinate frame. We show that if the 

emitter's signals are the orbit of a group, then the optimal detection method may not 

^1 be a covariant measurement (contrary to widespread belief). It may be advantageous for 



the receiver to use a different group and an indirect estimation method: first, an ordinary 



t^^ ' measurement supplies redundant numerical parameters; the latter are then used for a 

o : 

nonlinear optimal identification of the signal. 

O 

:^ 



S . 1. Indiscrete quantum information^ 



Information theory usually deals with the transmission of a sequence of discrete sym- 
bols, such as and 1. Even if the information to be transmitted is of continuous nature, 
/\ ' such as the position of a particle, it can be represented with arbitrary accuracy by a string 

b : 

of bits. However, there are situations where information cannot be encoded in such a way. 
For example, the emitter (conventionally called Alice) wants to indicate to the receiver 
(Bob) a direction in space. If they have a common coordinate system to which they can 
refer, or if they can create one by observing distant fixed stars, Alice simply communicates 
to Bob the components of a unit vector n along that direction, or its spherical coordinates 
6 and 0. But if no common coordinate system has been established, all she can do is to 
send a real physical object, such as a gyroscope, whose orientation is deemed stable. 

In the quantum world, the role of the gyroscope is played by a system with large spin. 
Earlier works [1-4] considered the use of spins for transmitting a single direction. The 

^Note to printer: this is not a typo. The term we use is indiscrete (meaning not discrete). Do not 
confuse that with the word indiscreet. 



simplest method [0 is to send these spins polarized along the direction that one wishes 
to indicate. This, however, is not the most efficient procedure: when two spins are trans- 
mitted, a higher accuracy is achieved by preparing them with opposite polarizations 0. 
If there are more than two spins, optimal results are obtained with entangled states |^, ^. 
The fidelity of the transmission is usually defined as 

F=(cos2(x/2)) = (l + (cosx))/2, (1) 

where x is the angle between the true n and the direction indicated by Bob's measurement. 
The physical meaning of F is that the infidelity, 

l-F=(sin2(x/2)), (2) 

is the mean square error of the measurement, if the error is defined as sin(x/2) ||^. The 
experimenter's aim, minimizing the mean square error, is the same as maximizing fidelity. 
We can of course define "error" in a different way, and then fidelity becomes a different 
function of x and optimization leads to different results. With the definition in Eq. (2), 
it can be shown [^ ^ that for a large number N of spins, the infidelity asymptotically 
tends to 

1 - F = 5.783/N^ = l.U6/d, (3) 

where d is the dimension of the subspace of Hilbert space that is effectively used for the 
transmission. 

A more difficult problem is the transmission of a complete Cartesian frame, if a single 
quantum messenger is available. In an earlier publication [^], we showed how a hydrogen 
atom (formally, a spinless particle in a Coulomb potential) can transmit a complete frame. 
We assumed the hydrogen atom to be in a Rydberg state (an energy eigenstate is needed 
to ensure the stability of the transmission). The n-th energy level of that atom has 
degeneracy d = n^ because the total angular momentum may take values j = 0, ■ ■ ■ , n — 1, 
and for each one of them m = —j, ■ ■ ■ ,j. A similar calculation was done by Bagan, Baig, 
and Munoz-Tapia (hereafter BBM) 0, who were able to reach much higher values of j 
and to prove that the asymptotic behavior was 

1 - F ^ 1/Vd. (4) 



Here the infidelity 1 — F is the sum of the mean square errors for three orthogonal axes. 

There is an essential difference between our work and that of BBM. We considered a 
single system (a hydrogen atom in a Rydberg state), while BBM took N spins, and one 
irreducible representation for each value j of the total angular momentum. The maximum 
value is jmax = N/2, and then the mathematics are the same as for our Rydberg state, 
with jmax = n — 1, as explained above. However, if there are N spins that can be sent 
independently, there is a better method. Alice can use half of them to indicate her z axis, 
and the other half for her x axis. The two directions found by Bob may not be exactly 
perpendicular, because separate transmissions have independent errors due to limited 
angular resolution. Some adjustment will be needed to obtain Bob's best estimates for 
the z and x axes, before he can infer from them his guess of Alice's y direction. Even 
without this adjustment, this method is far more accurate, especially if A^ is large. From 
Eq. (^, the mean square error for each one of the z and x axes is 5.783/(A^/2)^ = 23.13/A^^, 
rather than 4/3 A^ which is the result with the method used by BBM [0]. 

Similar results hold even for low values of A^. For example, if Alice has four spins at 
her disposal, she can do better in this way than with the BBM method: she sends two 
spins with opposite polarizations along her z axis (the Gisin-Popescu method [^) and two 
with opposite polarizations along her x axis. The infidelity for each one of these axes is 
0.21132 (this can still be improved by forcing orthogonality on Bob's axes, as explained 
in Sect. 4). On the other hand, with a hydrogen atom p and jmax = 2, if we optimize 
two axes, the mean square error per axis is 0.23865. Why is there such a discrepancy? 

2. Covariant measurements are not always optimal 

In all the works that were mentioned above, and in many other similar ones, it was 
assumed that Holevo's method of covariant measurements |^ gave optimal results. That 
method considers the case where Alice's signals are the orbit of a group Q, with elements g. 
Namely, if 1^4) is one of the signals, the others are \Ag) = U{g)\A), where U{g) is a 
unitary representation of the group element g. The only problem was to find optimal 
quantum states for Alice's signals and Bob's detectors. Originally, Holevo considered only 
irreducible representations. It is known now that in some cases reducible representations 
are preferable |^, Q . One then never needs to use more than one copy of each irreducible 
representation in the reducible one. For example, if 1^4^) has four spins as in the above 
example, this state can be written by using each one of j = 0,1,2 only once, as shown 



explicitly in the next section. 

We now turn our attention to Bob. The mathematical representation of his apparatus 
is a positive operator valued measure (POVM) |^, namely a resolution of identity by a 
set of positive operators: 

J:Eh = 1, (5) 

h 

where the label h indicates the outcome of Bob's experiment. This is true for any type 
of measurement, provided that the labels h are kept "raw" and not subjected to further 
classical processing into a new set of labels, as explained in Sect. 4 and 5. In the case of 
covariant measurements, the labels h run over all the elements of the group Q (with a 
suitable adjustment of the notation in the case of continuous groups). Then the probability 
that Bob's apparatus indicates group element h when Alice sent a signal \Ag) is 

P{h\g) = {A,\E,\A,). (6) 

The method of covariant measurements further assumes that E^ can be written as 

Eh = \Bh){Bh\, (7) 

where 

\Bh)=U{h)\B). (8) 

Here, \B) is a fiducial vector for Bob (which has to be optimized) and U{h) is a repre- 
sentation (possibly a direct sum of irreducible representations) of the same group Q that 
Alice is using. 

All this seems quite reasonable (and this indeed usually works well) but, as the above 
example of four spins shows, this may not be the optimal method. In that example, 
Alice's signals 1^4^), for all possible positions of her axes, are 5*0(3) rotations of a fiducial 
state \A) with j = 0,1,2 (see next section for details). On the other hand Bob uses 
two separate POVMs, each one testing only two of the four spins. Each one of these 
POVMs also involves S0{3), but with j = and 1 only. (Strictly speaking, the relevant 
mathematical structure is 5*2 ® 5*2, where 5*2 is the quotient SO{3)/SO{2), namely the 
two-dimensional sphere which is not a group. We shall ignore this technical point and 
informally call it a group, to avoid unnecessarily cumbersome terminology.) 



3. Equivalent irreducible representations 

Let us examine carefully the meaning and construction of equivalent irreducible repre- 
sentations. Unitary equivalence is not equivalence from the point of view of physics [T^ . 



A simple example is a particle of spin |, whose state space has four dimensions and is 
unitarily equivalent to that of a pair of spin ^ particles. In atomic physics, unitarily 
equivalent representation naturally arise when we consider different couplings of the var- 
ious spins, and we use Clebsch-Gordan coefficients in order to construct new states in a 
systematic way. These "equivalent" unitary representations actually correspond to quite 
different states. For example, if we have three spins and we wish to construct states of 
total spin |, we may couple two of the spins into a singlet, so as to get a doublet: 

|0)®(|01)-|10))/V2 and |1) ® (|01) - |10))/V2, (9) 

where |0) and |1) denote the eigenstates of a^, as usual. The rotations of this doublet 
generate an irreducible representation of SU{2). 

We can also generate other, equivalent, irreducible representations by starting with 
different pairs of spins to make a singlet. These equivalent representations have of course 
different physical meanings. If we used quantum numbers for indicating internal symme- 
tries, they would have different quantum numbers. 

It was shown in ||^, Q that the use of more than one equivalent representation does not 
improve the fidelity of the transmission. In these articles, the choice of that representation 
was irrelevant (of course Alice and Bob had to use the same one). However, in some cases, 
that choice may be imposed on us. 

As a simple example, consider a given state |001) of three spins. We want to split it 
into a spin | component and a single spin | component. This can easily be done, but the 
spin I component will not be of the type represented by Eq. (9), or by any permutation 
of the three spins in Eq. (9). To see that, we note that spin | states are symmetric under 
permutations of the particles. Therefore we project |001) on 

||,i) = (|001) + |010) + |100))/V3. (10) 

It follows that the J = | part of |001) is 

||,i)(|,||001) = (|001) + |010) + |100))/3. (11) 



What remains of |001), namely 

|001) - (|001) + |010) + |100))/3 = (2|001) - |010) - |100))/3, (12) 

is the spin | part. This is not the direct product of a singlet and a doublet as in Eq. (^. 
Still that state generates a perfectly legitimate irreducible representation, with j = |. (Of 
course, had we started from a different state, such as |010), we would have ended with a 
different basis for the irreducible representation with j = |.) 

We have a similar construction, but slightly more complicated, for Alice's signal having 
two opposite spins oriented along z and two others along x, as proposed at the end of 
Sect. 1. It should be clear that a pair of signals (or any combination of signals) still are 
one signal. Ahce's signal thus is, with the same notations as above 

\A) = |01) ® (|0) + |1)) ® (|0) - |l))/2 = (10100) + 10110) - 10101) - |0111))/2. (13) 

To find the parts of \A) with j = 0, 1, 2, we proceed as we did for the case of three spins 
(we shall not show the explicit calculations, which are quite lengthy, because they are not 
necessary for the sequel). 

Two points should be emphasized. In earlier works |0, 01; it was not necessary to 
specify the actual construction of the irreducible representations that were used. Their 
choice was arbitrary and irrelevant. It just had to be the same for Alice and Bob. Now, 
the situation is different: these representations are uniquely defined by Alice's signal in 
Eq. ([T3|). The choice of that particular signal was suggested by the Gisin-Popescu method 
for two spins [0. We do not claim that it is the optimal signal for two pairs of spins. To 
actually investigate optimization, Alice's signal should be taken as general as possible, 
namely a sum of 16 terms, 

|A) =ao|0000) + --- + ai5|llll), (14) 

and we would have to determine the coefficients a„, subject to normalization J2 l^nP = 1- 
However, this optimization is a long shot beyond the scope of the present article. Here, 
we only want to show that covariant measurements are not always optimal. 



4. Contravariant quantum measurements 

We now come to Bob's measurement method. As explained in Sect. 1, Bob examines 
separately the spins sent by Alice to indicate her z axis, and those indicating her x axis. 
(The argument below applies to any number of spins, not just two spins for each axis.) In 
his coordinate frame. Bob thus gets two sets of polar angles, Qz'^z and Qx'^x respectively, 
from which he has to infer the Euler angles ?/'6'0 that transform Alice's Cartesian frame 
into his frame. If Bob's measurements were perfect, the relations between these angles 
would be given by equating the Cartesian components of Bob's results for Alice's z and x 



axes with the corresponding columns of the orthogonal transformation matrix |[TT| . This 
gives 
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These are four independent equations (owing to normalization) for three unknowns. If 
Bob's experimental data were exact, a simple solution would be to obtain from Eq. (pTS]) 



Q = Q. 



and 



^ = {n/2) - 0, 



(17) 



The fidelity of this result, namely for finding the direction of a single axis by using any 
number of spins, is discussed in [^ Q where it is shown that, asymtotically, (1 — -F) oc N~'^. 
Once 6 is known, (p can be obtained from the third line of ([T6|): 

sin = cos 6x/ sin 9z, (18) 



where use was made of the result in Eq. ([T7|) . Now there is a difficulty: if N is finite, 
Bob's estimates are not perfectly accurate and the right hand side of (18) may be larger 
than 1. In general, it is preferable to solve the four equations (|15|) and (|T6|) simultaneously 
and to seek a best fit for the three unknowns. The accuracy of this best fit is of course 
better than the one given by Eqs. (|T7D and (18), where one of the four original equations 



was ignored. A simple geometric construction of the solution is as follows: first, find the 
direction perpendicular to the estimated z and x axes; this direction is the best estimate 
for the y axis, and therefore for the zx plane. Then, in that plane, the angle between 
the estimated z and x axes (given by Qz'^z and Qx'^x respectively) is adjusted so that they 
become exactly perpendicular. Detailed calculations are under progress (we hope they 
will appear in a future publication). 

5. The dihedral group 

A clearer understanding of contravariant measurements may be gained by using a finite 
group. As a concrete example, let us consider six directions, defined by the polar angles 
Q = 45° or 135°, and = or ±120°. Alice wishes to indicate one of these directions to 
Bob. Now, these directions are the orbit of the dihedral group D^ with six elements: E 
(the identity). A, B, C (rotations by 180° around the symmetry axes (p = and (p = ±120° 
in the xy plane), and D, F (rotations by ±120° around the z axis). Here, we are using the 
same notations as Wigner |jl2|. This group has a one-dimensional representation where 
all the elements are 1, another where E, D, F are 1, while A, B, C are —1, and there 



is a two-dimensional representation, explicitly given in []12[. From the characters of these 



three representations, it is possible to find the contents of any other, reducible one. 

Suppose that Alice has a single particle of spin |, and she wants to indicate to Bob 
which one of the six directions she has chosen. Obviously, she orients her spin along that 
direction, so that there are six input states, 

p = (1 ± n . o-)/2, (19) 

where n = (sin 6' cos 0, sin 6' sin 0, cos 6'). Likewise Bob has six POVM elements 

Em = {l + m- (t)/6. (20) 

Note that J2 Em = 1- Then the probability of Bob getting result m is Alice's input is n 
is 

P(m|n) =tr(pE„) = (l±n-m)/6. (21) 

Note that the probability of getting the correct result is always 1/3. 

We must now specify a criterion for the fidelity of the transmission. A simple one is 
to give a score 1 is Bob guesses the correct result, and for all incorrect results. It could 

8 



also be argued that some results are more incorrect than others, just as large errors x iii 
Eq. (1) are more heavily penalized than small error angles. Here, for a group of order 6, 
we could assume that elements belonging to the same class are less wrong than those 
belonging to different classes of the group, and incur a lesser penalty. However, we shall 
just assume that all wrong results are equally worthless. Therefore the best that can be 
achieved with one spin is fidelity F = 1/3. 

Suppose now that Alice sends several spins. Rather than using individual measure- 
ments and classical statistics. Bob may perform joint measurements on all these spins. 
For example, if there are two spins, their state belongs to the rotation group represen- 
tations with j = 0, 1. However, that group is too rich: we are interested only in the 
D3 group which is a subgroup of SO (3). Obviously j = corresponds to the symmetric 
one-dimensional representation of D^ (all the elements are 1). As for j = 1, we have to 
find the characters of all the rotations that correspond to elements of D^. This is very 
easy, because one may use for 50(3) the real orthogonal representation, and then, owing 



to Euler's theorem |jTl|, the characters (that is, the traces of the rotation matrices) are 
equal to 1 + 2cos^, where ^ is the rotation angle. We thus find that the character of 
E is 3, those of A, B and C are —1, and those of D and F vanish. This means that 
the triplet state involves the one-dimension representation with alternating signs and the 
two-dimensional representation. 

Therefore a pair of spin | particles generates all the representations of D3, each one 
once. Taking more spins will not produce any new irreducible representation. If Bob is 
restricted to the use of covariant measurements, the maximal fidelity that can be achieved 
is 2/3 (detailed calculations are given in an Appendix). 

A simple method which gives better results is the following. Alice sends N spins, all 
aligned in the direction she wants to indicate, as in Ref. W^. This is an angular momentum 



coherent state |T3[ for spin j = N/2: 



n.J\ij)=j\ij). (22) 

(This is surely not the optimal strategy. In the present paper, we are not seeking optimal- 
ity. We only want to show that some methods give a better fidelity than a straightforward 
covariant measurement.) Bob then performs the covariant measurement for a particle of 
spin j. His POVM elements are coherent states as above, with directions uniformly dis- 



9 



tributed over the two-dimensional sphere. The overlap of two such states is |T^ 



cos^^- (x/2) = cos^^ (x/2), (23) 

where x is the angle between the true and estimated directions. 

Once Bob has found a result Ocf) (this is what we call the "raw" result), he infers 
(guesses) that the true answer for Alice's signal is the direction n closest to dcj). It is, 
as in the preceding example, the best fit for the answer, knowing the approximate value 
given by 9 and (j). As there are finite angles between the six directions n that Alice can 
use, it follows from (23) that the probability of error decreases exponentially with A^. It 
is plausible that a truly optimal method would also have such an exponential accuracy, 
but with a larger coefficient for A^ in the exponent. 

6. Concluding remarks and apologies 

Due to the pressure of a deadline, we did not attempt to find the optimal strategy 
in the two examples given above. In both cases, the same pattern emerges. Bob first 
executes a POVM which uses a group that is not the one for Alice's signals. This POVM 
gives redundant raw data to Bob, from which he infers, by a classical statistical analysis, 
the best estimate for identifying the signal. This final best estimate is nonlinear and it 
cannot he obtained directly by a POVM of rank one, as in Eq. (|^). Indeed, if it could, 
then it would have to be a covariant POVM, and we have just shown that this is not the 
best method. Explicitly, the POVM elements that give the best guess are Eg = J^Eh, 
where the sum runs over all the raw outcomes h that lead to the same guess g. 

Note that there is no contradiction with Gleason's theorem |14] because the proof of 



the latter refers only to the outcome of the measuring process, without a further best fit 
or other classical statistical analysis. We hope that further research on this problem will 
clarify the missing details and give a complete prescription for the optimal procedure. 
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Appendix. Dihedral group signals with one or two spins 

In this Appendix we analyze covariant measurements for the detection of signals be- 
longing to the D^ group. If only one particle of spin ^ is sent by Alice, its state is given 
as usual by 

/ cos(^/2) \ 

where 6 and are the angles that correspond to group element g. Bob's fiducial state is 

\B) = \2e)IV?>. (A 2) 

(The factor -^/S comes from the order of the group divided by the number of dimen- 
sions |13[-) Then 

Y^U{g)\B){B\U\g) = X (A3) 

is Bob's POVM. The fidelity, which is defined in this problem as the probability of a 
correct result, is 

F=\{2e\B)\' = \. (A 4) 

Suppose now that Alice has two spins. She sends them both in state |2g), and Bob 
tests them separately. The probability that he gets twice the correct answer is |. The 
probability that he gets the correct answer once, together with one wrong answer, is |. 
In the latter case, faced with an ambiguous result. Bob will make a random choice. Then 
the final probability for a correct guess is | + | = |, exactly as for a single signal! 

If Alice sent more than two spins in this way and Bob tested them separately, the 
result, given by a multinomial distribution, would slowly improve. As shown in Sect. 5, 
Alice and Bob can do much better by using entangled signals. Let us examine how well 
they can do if they have unlimited experimental skill, but are restricted to the use of 
covariant measurements. 

Alice prepares a signal state within the orbit of group Q: 

\Ag) = ao\Qg)+ai\lg) + a2\2g), (A 5) 
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where \0g) corresponds to the trivial representation (all the |0c,) are equal), |lg) to the 



alternating one, and \2g) is still given by Eq. (|A 1|) . The coefficients a^ are normalized: 

|«oP + |ai|^ + 102^ = 1- (A 6) 

Bob's fiducial vector now is 



\B) = ^1/6 \0e) + Vl/6 \Ie) + Vl/3 \2e), (A 7) 

so that Eq. ( |A 3| ) is still valid. 

The probability for a correct result thus is 

F = \{B\Ae)\^ = \{ao + aO/Ve + as/Vsp. (A 8) 



This is a quadratic expression for the coefficients am, subject to the normalization ( |A 6| ). 
Its maximum is easily found to be |, when 

ao = ai = 1/2 and as = 1/^2. (A 9) 

We see that this optimal covariant measurement only improves the fidelity from | to |. A 
contravariant measurement such as the one described in Sect. 4 gives better results: the 
infidelity (1 — F) decreases exponentially with the number of spins, owing to Eq. (^). 
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